perf(topn): reduce unnecessary table scan in streaming (Group)TopN executors #13832

stdrc · 2023-12-06T08:01:54Z

I hereby agree to the terms of the RisingWave Labs, Inc. Contributor License Agreement.

What's changed and what's your intention?

According to #13797 and some further investigation, when using a temporal filter with short time period, it's very likely that the high part of TopN cache frequently being deleted to empty. And because TopN executors (actually the managed TopN state) just blindly scan the state table to fill high cache, the performance can be significantly impacted.

This PR maintains a row_count in the managed TopN state, so that it's possible to avoid table scan when cache is empty if the managed state knowns the cache is still synced with the table (row counts match).

Fixes #13797.

Checklist

I have written necessary rustdoc comments
I have added necessary unit tests and integration tests
I have added test labels as necessary. See details.
I have added fuzzing tests or opened an issue to track them. (Optional, recommended for new SQL features Sqlsmith: Sql feature generation #7934).
My PR contains breaking changes. (If it deprecates some features, please create a tracking issue to remove them in the future).
All checks passed in ./risedev check (or alias, ./risedev c)
My PR changes performance-critical code. (Please run macro/micro-benchmarks and show the results.)

My PR contains critical fixes that are necessary to be merged into the latest release. (Please check out the details)

Documentation

My PR needs documentation updates. (Please use the Release note section below to summarize the impact on users)

Release note

If this PR includes changes that directly affect users or other significant modifications relevant to the community, kindly draft a release note to provide a concise summary of these changes. Please prioritize highlighting the impact these changes will have on users.

…State Signed-off-by: Richard Chien <[email protected]>

Signed-off-by: Richard Chien <[email protected]>

Little-Wallace · 2023-12-07T06:54:43Z

src/stream/src/executor/top_n/top_n_state.rs

            cache.insert(topn_row.cache_key, (&topn_row.row).into());
            if cache.len() == cache_size_limit {
+                table_row_count = None; // cache becomes full, we cannot get precise table row count this time


I think we shall check this euqal before insert, so we can know that the number of record in hummock is exactly larger than cache. As following:

if cache.len() == cache_size_limit { table_row_count = None; break; } cache.insert(topn_row.cache_key, (&topn_row.row).into());

Wait a minute, I'm still working on this. Not ready😁

I don't quite get it. When fill_high_cache being called, the high cache will never be full, so this part of code is just to insert into the cache until it reaches the size limit.

Little-Wallace · 2023-12-07T06:55:34Z

src/stream/src/executor/top_n/top_n_state.rs

    }

    pub fn delete(&mut self, value: impl Row) {
        self.state_table.delete(value);
+        if let Some(row_count) = self.row_count.as_mut() {


Can we ensure that the delete key must exist in hummock ?

I believe now we have state table delete sanity check right?

Signed-off-by: Richard Chien <[email protected]>

src/stream/src/executor/top_n/top_n_state.rs

src/stream/src/executor/top_n/top_n_cache.rs

xxchan · 2023-12-08T02:04:12Z

src/stream/src/executor/top_n/top_n_cache.rs

@@ -325,19 +351,20 @@ impl TopNCacheTrait for TopNCache<false> {
            && (self.offset == 0 || cache_key > *self.low.last_key_value().unwrap().0)
        {
            // The row is in mid
+            self.middle.remove(&cache_key);


why to move this?

Need to keep the entries in cache consistent with table

xxchan

The idea sounds OK to me. May test whether it works.

Signed-off-by: Richard Chien <[email protected]>

xxchan · 2023-12-08T08:33:04Z

Did you tested the perf issue is resolved by this?

stdrc · 2023-12-08T08:34:37Z

Did you tested the perf issue is resolved by this?

Will run a test soon

codecov · 2023-12-08T09:29:29Z

Codecov Report

Attention: 10 lines in your changes are missing coverage. Please review.

Comparison is base (c74817f) 68.06% compared to head (30e76b7) 68.05%.
Report is 5 commits behind head on main.

Files	Patch %	Lines
src/stream/src/executor/top_n/top_n_state.rs	82.35%	6 Missing ⚠️
...tream/src/executor/top_n/group_top_n_appendonly.rs	0.00%	3 Missing ⚠️
src/stream/src/executor/top_n/group_top_n.rs	66.66%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main   #13832      +/-   ##
==========================================
- Coverage   68.06%   68.05%   -0.02%     
==========================================
  Files        1535     1535              
  Lines      264826   264874      +48     
==========================================
+ Hits       180266   180269       +3     
- Misses      84560    84605      +45

Flag	Coverage Δ
rust	`68.05% <87.95%> (-0.02%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

stdrc · 2023-12-11T05:17:35Z

The throughput doesn't change too much while the bloom filter false-positive rate does decrease compared to #13797.

cc @st1page @Little-Wallace

Little-Wallace · 2023-12-11T12:23:37Z

The throughput doesn't change too much while the bloom filter false-positive rate does decrease compared to #13797.

Thanks to your work. Could you run a test again because #13558 have been merged and it may reduce some IO operations.

stdrc · 2023-12-11T17:37:32Z

The throughput doesn't change too much while the bloom filter false-positive rate does decrease compared to #13797.

Thanks to your work. Could you run a test again because #13558 have been merged and it may reduce some IO operations.

https://buildkite.com/risingwave-test/nexmark-benchmark/builds/2649#018c597f-5684-4b84-a7db-eddfc753f995

Something was going wrong...

Signed-off-by: Richard Chien <[email protected]>

stdrc · 2023-12-13T03:12:36Z

Now FPR drops to near zero.

st1page · 2023-12-13T07:23:26Z

The throughput doesn't change too much while the bloom filter false-positive rate does decrease compared to #13797.

Thanks to your work. Could you run a test again because #13558 have been merged and it may reduce some IO operations.

https://buildkite.com/risingwave-test/nexmark-benchmark/builds/2649#018c597f-5684-4b84-a7db-eddfc753f995

Something was going wrong...

The barrier's piling up is because at the beginning, the source emits insert operations, and after 5 minutes passed, the temporal filter began to emit delete operations. Then the performance degraded, source executors were backpressured. But the temporal filter can not be back pressured so it still emits the delete message.
Is the #13271 included in your branch? If no, we can merge main and take a look.

stdrc · 2023-12-13T07:56:35Z

Is the #13271 included in your branch? If no, we can merge main and take a look.

It is included🥵

Little-Wallace · 2023-12-13T08:02:49Z

Although the throughput does not increase much, this PR still reduce IOPS of hummock and FPR:

st1page · 2023-12-13T08:23:07Z

The low performance might because of #13968

Signed-off-by: Richard Chien <[email protected]>

st1page

Let's merge the PR first because it actually solve a part of performance issue.

fuyufjh · 2023-12-14T05:22:31Z

Hey @st1page @stdrc Thanks for the work! Which query are you testing with? Wondering how much it solves the performance issue.

stdrc · 2023-12-15T06:43:50Z

Which query are you testing with?

nexmark q9-temporal-filter, the main contribution of this PR was to reduce table iter ops

fuyufjh · 2023-12-15T06:46:45Z

Which query are you testing with?

nexmark q9-temporal-filter, the main contribution of this PR was to reduce table iter ops

Can you share the effect? like how much table iter ops was reduced after this PR

stdrc · 2023-12-15T07:09:23Z

Which query are you testing with?

nexmark q9-temporal-filter, the main contribution of this PR was to reduce table iter ops

Can you share the effect? like how much table iter ops was reduced after this PR

As originally reported in #13797, the bloom filter false-positive rate is high (up to 50% in most time), which means near half of table iter operations are wasted.

After this PR, the FPR is near 0% in most time, around 20% when warming up (many groups are new, so that manually maintained in-executor table_row_counts don't help). So we think that this PR did solve an aspect of the issue.

However, throughput was still not improved as expected. As @st1page found, another problem (possibly the true IO bottleneck) is that GroupTopN doesn't fetch missing groups concurrently (#13968), which seems likely to prevent from storage layer batching.

…ecutors (#13832) Signed-off-by: Richard Chien <[email protected]>

Signed-off-by: Richard Chien <[email protected]> Co-authored-by: Richard Chien <[email protected]>

stdrc added 3 commits December 6, 2023 14:56

refactor: hide state table from being mutate from outside ManagedTopN…

ee5cdfe

…State Signed-off-by: Richard Chien <[email protected]>

perf: only iterator over table when not all rows are cached

88c5acd

Signed-off-by: Richard Chien <[email protected]>

fix: typo

9efd4bc

Signed-off-by: Richard Chien <[email protected]>

github-actions bot added the type/perf label Dec 6, 2023

Little-Wallace reviewed Dec 7, 2023

View reviewed changes

stdrc added 2 commits December 7, 2023 15:40

fix: maintain row count for each group

2e72728

Signed-off-by: Richard Chien <[email protected]>

Merge branch 'main' into rc/fix-group-topn-too-many-table-scan

c02fefd

stdrc marked this pull request as ready for review December 7, 2023 07:42

stdrc requested review from yuhao-su, xxchan, wcy-fdu and st1page December 7, 2023 07:42

xxchan reviewed Dec 8, 2023

View reviewed changes

src/stream/src/executor/top_n/top_n_state.rs Outdated Show resolved Hide resolved

src/stream/src/executor/top_n/top_n_cache.rs Show resolved Hide resolved

xxchan reviewed Dec 8, 2023

View reviewed changes

stdrc added 2 commits December 8, 2023 16:24

fix: fix an EXTREMELY STUPID bug

3fd3ca8

Signed-off-by: Richard Chien <[email protected]>

perf: init table_row_count at startup if possible

8be1ebf

Signed-off-by: Richard Chien <[email protected]>

Merge branch 'main' into rc/fix-group-topn-too-many-table-scan

8203676

stdrc added 2 commits December 11, 2023 20:39

Merge branch 'main' into rc/fix-group-topn-too-many-table-scan

44c30f5

Merge branch 'main' into rc/fix-group-topn-too-many-table-scan

91a7da8

stdrc added 3 commits December 12, 2023 20:13

fix: fix row count maintanence

a1e7f91

Signed-off-by: Richard Chien <[email protected]>

Merge branch 'main' into rc/fix-group-topn-too-many-table-scan

39b8b41

perf: init topn cache table_row_count

4a2570b

Signed-off-by: Richard Chien <[email protected]>

add a tmp log

68e689d

Signed-off-by: Richard Chien <[email protected]>

st1page mentioned this pull request Dec 13, 2023

perf bug: barriers pile up due to temporal filter #13807

Closed

stdrc added 2 commits December 13, 2023 16:23

add min topn cache capacity

e5df4a7

Signed-off-by: Richard Chien <[email protected]>

remove tmp log

30e76b7

Signed-off-by: Richard Chien <[email protected]>

st1page approved these changes Dec 13, 2023

View reviewed changes

stdrc enabled auto-merge December 13, 2023 08:34

stdrc added this pull request to the merge queue Dec 13, 2023

Merged via the queue into main with commit 8db6191 Dec 13, 2023
26 of 27 checks passed

stdrc deleted the rc/fix-group-topn-too-many-table-scan branch December 13, 2023 09:22

hzxa21 added the need-cherry-pick-release-1.5 label Dec 25, 2023

wenym1 pushed a commit that referenced this pull request Dec 25, 2023

perf(topn): reduce unnecessary table scan in streaming (Group)TopN ex…

28a30d5

…ecutors (#13832) Signed-off-by: Richard Chien <[email protected]>

wenym1 mentioned this pull request Dec 25, 2023

fix: cherry-pick #13271 and #13832 to release-1.5 #14179

Merged

9 tasks

wenym1 pushed a commit that referenced this pull request Dec 25, 2023

perf(topn): reduce unnecessary table scan in streaming (Group)TopN ex…

224aa3a

…ecutors (#13832) Signed-off-by: Richard Chien <[email protected]>

wenym1 added a commit that referenced this pull request Dec 25, 2023

fix: cherry-pick #13271 and #13832 to release-1.5 (#14179)

86b81ea

Signed-off-by: Richard Chien <[email protected]> Co-authored-by: Richard Chien <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(topn): reduce unnecessary table scan in streaming (Group)TopN executors #13832

perf(topn): reduce unnecessary table scan in streaming (Group)TopN executors #13832

stdrc commented Dec 6, 2023 •

edited

Loading

Little-Wallace Dec 7, 2023

stdrc Dec 7, 2023

stdrc Dec 8, 2023

Little-Wallace Dec 7, 2023

stdrc Dec 8, 2023

xxchan Dec 8, 2023

stdrc Dec 8, 2023

xxchan left a comment

xxchan commented Dec 8, 2023

stdrc commented Dec 8, 2023

codecov bot commented Dec 8, 2023 •

edited

Loading

stdrc commented Dec 11, 2023

Little-Wallace commented Dec 11, 2023

stdrc commented Dec 11, 2023 •

edited

Loading

stdrc commented Dec 13, 2023

st1page commented Dec 13, 2023

stdrc commented Dec 13, 2023

Little-Wallace commented Dec 13, 2023

st1page commented Dec 13, 2023

st1page left a comment

fuyufjh commented Dec 14, 2023

stdrc commented Dec 15, 2023

fuyufjh commented Dec 15, 2023

stdrc commented Dec 15, 2023

perf(topn): reduce unnecessary table scan in streaming (Group)TopN executors #13832

perf(topn): reduce unnecessary table scan in streaming (Group)TopN executors #13832

Conversation

stdrc commented Dec 6, 2023 • edited Loading

What's changed and what's your intention?

Checklist

Documentation

Release note

Little-Wallace Dec 7, 2023

Choose a reason for hiding this comment

stdrc Dec 7, 2023

Choose a reason for hiding this comment

stdrc Dec 8, 2023

Choose a reason for hiding this comment

Little-Wallace Dec 7, 2023

Choose a reason for hiding this comment

stdrc Dec 8, 2023

Choose a reason for hiding this comment

xxchan Dec 8, 2023

Choose a reason for hiding this comment

stdrc Dec 8, 2023

Choose a reason for hiding this comment

xxchan left a comment

Choose a reason for hiding this comment

xxchan commented Dec 8, 2023

stdrc commented Dec 8, 2023

codecov bot commented Dec 8, 2023 • edited Loading

Codecov Report

stdrc commented Dec 11, 2023

Little-Wallace commented Dec 11, 2023

stdrc commented Dec 11, 2023 • edited Loading

stdrc commented Dec 13, 2023

st1page commented Dec 13, 2023

stdrc commented Dec 13, 2023

Little-Wallace commented Dec 13, 2023

st1page commented Dec 13, 2023

st1page left a comment

Choose a reason for hiding this comment

fuyufjh commented Dec 14, 2023

stdrc commented Dec 15, 2023

fuyufjh commented Dec 15, 2023

stdrc commented Dec 15, 2023

stdrc commented Dec 6, 2023 •

edited

Loading

codecov bot commented Dec 8, 2023 •

edited

Loading

stdrc commented Dec 11, 2023 •

edited

Loading